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FIELD OF THE INVENTION 

The present invention relates generally to alkaloids and alkaloid biosynthesis. In 
particular, the invention pertains to the nucleic acids encoding loline alkaloid synthesis genes and 
the tailoring enzymes of loline alkaloid biosynthesis, and to recombinant vectors and host cells 
containing such genes, and to the recombinant production of alkaloids and uses thereof. 

BACKGROUND OF THE INVENTION 

Loline alkaloids (LA; saturated 1-aminopyrrolizidine alkaloids with an ether bridge, 
Fig.l), are produced in a number of associations of grasses with endophytes of the genus 
Epichloe and their asexual descendants, Neotyphodium spp. In addition, LA are reported from 
the plants Adenocarpus spp. and Argyreia mollis of the families Fabaceae and Convolvulaceae, 
respectively. LA produced in grass-endophyte symbioses have strong insecticidal and feeding- 
deterrent properties (Riedell,et al., 1999, J Entomol. Sci. 26: 122-129; Wilkinson et al. y 2000, 
Mol. Plant-Microbe Interact. 13: 1027-1033). Moreover, grasses infected by LA-producing 
endophytes, such as Neotyphodium coenophialum and N. uncinatum, have greater tolerance to 
drought conditions (Arechavaleta et al., 1989, Agron. J 81: 83-90; Bacon, 1993, Agric. Ecos. 
Environ. 44: 123-141 ) than grasses infected by closely related endophytes, such as N. lolii, that 
do not produce LA (Barker et al., 1997, Agric. Ecos. Environ. 44: 123-141; ; Cheplick et al., 
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2000, Mycol. Res. 97: 1083-1092.). Growth suppression (allelopathy) of neighboring plants by 
meadow fescue (Lolium pratense) infected with Af. may indicate a potential for additional 
beneficial roles of these alkaloids in grass plant competitiveness and persistence. 

LA can accumulate to extremely high levels in grass tissues, occasionally reaching more 
than 2% of the plant's dry mass (Craven et al. 9 2001, Sydowia 53: 44-73). These quantities far 
exceed the biomass of the fungus and the amounts of other alkaloids, such as ergot alkaloids, 
indole-diterpenoids, and peramine, also produced in some of the endophyte-grass symbiota. 
However, despite their exceptional levels in the grass and importance of LA in grass survival, 
little is known about LA biosynthesis. This is in contrast to some of the other endophyte- 
associated alkaloids, such as ergopeptines and indole-diterpenoids, for which much of the 
biosynthetic pathways have been elucidated and key enzymes identified. 

It was previously unknown whether LA are of fungal or plant origin, or produced by both 
symbiotic partners together, but a recent study has established that N. uncinatum can produce LA 
in axenic culture (Blankenship et aL, 2001, Phytochemistry 58: 395-401). This finding presents 
opportunities to identify genes involved in LA biosynthesis. Knowledge of the LA biosynthesis 
genes would allow more detailed studies on the roles of LA in plant persistence, in particular on 
possible contributions to abiotic stress tolerance, as well as the cloning and use of these genes to 
generate genetically engineered plants. 

SUMMARY OF THE INVENTION 

In one aspect, the present invention provides isolated nucleic acid compounds comprising 
at least a sequence identical or complementary to all or part of a coding sequence for the loline 
alkaloid biosynthetic gene cluster from Netotyphodium uncinatum (SEQ ED NO. 15, and SEQ ID 
NO. 16). It appears that SEQ ID NO: 17 may be linked to the 5' end of SEQ ID NO: 16. 
Preferably, a part of said coding sequence is an open reading frame (ORF) selected from the 
group consisting of ORF1, ORF2, ORF3, ORF4, ORF5, ORF6, ORF7, ORF8, ORF9, ORF1', 
ORF2', ORF3\ ORF4', ORF5\ ORF6', ORF7', ORF8', ORF9' or ORF10\ More preferably, a part 
of said coding sequence is an ORF selected from the group consisting of ORF1, ORF2, ORF3, 
ORF4, ORF5, ORF6, ORF7, ORF8, ORF9, ORF1', ORF2\ ORF3', ORF4', ORF5', ORF6', 
ORF7', and ORF8\ 
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In one embodiment, the present invention provides an isolated nucleic acid strand that 
encodes a loline alkaloid gene cluster or subunit thereof comprising a nucleotide sequence 
identical or complementary to, or an amino acid sequence encoded by a nucleotide sequence 
identical or complementary to, all or part of a coding sequence for loline alkaloid biosynthetic 
gene cluster of SEQ ID NO. 15 or SEQ ID NO. 16. Preferably, the gene cluster encodes a 
functional gene cluster and optionally, selected tailoring enzymes. The gene cluster may be 
derived from a single species or may be hybrid in nature. In certain embodiments, the gene 
cluster is a replacement gene cluster. The replacement gene cluster may be a variant, hybrid, 
mutant, analog or derivative thereof. 

In another embodiment, the invention provides an isolated nucleic acid that encodes three 
or more ORFs comprising a sequence identical or complementary to all or part of a coding 
sequence for enzymes performing the biosynthesis of loline alkaloids from Neotyphodium 
uncinatum. Preferably, the ORFs encode a functional gene cluster and optionally, selected 
tailoring enzymes. In certain embodiments, an ORF may be derived from a single species or 
may be hybrid in nature. In certain embodiments at least one of the ORFs is native to the loline 
alkaloid gene cluster of SEQ ID NO. 15 or SEQ ID NO. 16. In certain other embodiments, at 
least one of the ORFs is native to SEQ ID NO: 17. In still other embodiments, at least one ORF 
is derived from a non-loline alkaloid producing Neotyphodium strain, or is hybrid in nature. In 
yet other embodiments, at least one ORF is a variant, mutant, analog or derivative of the native 
coding sequence of SEQ ID NO: 15, SEQ ID NO: 16 or SEQ ID NO: 17. 

In still another embodiment, the present invention provides isolated nucleic acid 
compounds comprising three or more genes of the coding sequence for the biosynthesis of loline 
alkaloids. Preferably, the mixture of genes encode a functional gene cluster and optionally, 
selected tailoring enzymes. In certain embodiments, a gene may be derived from a single species 
or may be hybrid in nature. In certain embodiments at least one gene is derived from a loline 
alkaloid biosynthetic gene cluster. In other embodiments, at least one gene is derived from a 
non-loline alkaloid producing Neotyphodium strain, or is hybrid in nature. Non-limiting 
exemplary non-Neotyphodium biosynthetic genes are preferably subunits of the Neotyphodium 
australiense, Neotyphodium huerfanum, Neotyphodium inebrians, Neotyphodium lolii, and 
Neotyphodium melicicola gene clusters. In yet other embodiments, at least one gene may be a 
variant, mutant, analog or derivative of the native coding sequence of SEQ ED NO: 15, SEQ ED 
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NO: 16 or SEQ ID NO: 17. It is also preferred that the encoded activity of the gene is that of, for 
example and without limitation, an epoxidase, a-type pyridoxal phosphate (PLP) associated 
enzymes, including, by example, class-v aminotransferase, cytochromes P450, aspartate kinase 
allosteric amino acid binding domain, oxidoreductase, ornithine decarboxylase, y-type PLP 
enzyme, FAD-containing monooxygenase, and cyclohexanone oxidase. 

In another aspect, the present invention provides recombinant expression vectors 
encoding a loline alkaloid gene cluster, or variants, hybrids, mutants, analogs or derivatives 
thereof. In certain embodiments, vectors encode one or more subunit of a loline alkaloid gene 
cluster, or variants, hybrids, mutants, analogs or derivatives thereof. 

In another aspect, the present invention provides a host cell transformed with a 
recombinant expression vector described herein. 

In still another aspect, the invention provides a method of preparing loline alkaloid, said 
method comprising introducing a recombinant vector that encodes a loline alkaloid gene cluster 
or subunit thereof into a host cell, culturing said host cell under conditions such that loline 
alkaloid is produced or expressed, and isolating the loline alkaloid from the host cell. In one 
embodiment, the method is practiced with an E. coli host cell. The gene cluster may be a 
replacement gene cluster and preferably a functional gene cluster. In certain embodiments, the 
invention provides methods for preparing new alkaloid-type compounds, preferably, loline-type 
alkaloids. The loline-type alkaloid produced may be loline alkaloid or loline alkaloid variants, 
hybrids, mutants, analogs or derivatives thereof. Such alkaloids are useful as an insecticide. 

These and other embodiments and aspects of the invention will be more fully understood 
after consideration of the attached Drawings and their brief description below, together with the 
detailed description, example, and claims that follow. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows the structures of the loline alkaloids found in certain grass- 

Epichloe/Neotyphodium symbiota. N-Formylloline and N-acetylnorloline were abundant in N. 

uncinatum grown in LA-inducing medium. 

Figure 2 is an autoradiograph showing expression of transcripts isolated in the 

suppression subtractive hybridization in loline-producing (+) and suppressed (-) cultures. In 

each lane was loaded 0.5 jig of total cDNA synthesized from total RNA and probed with 
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subtracted cDNA; molecular sizes indicated (in kilobases) correspond to molecular marker 
(HindlU/EcoRI-cul X.DNA) run on the same gel. 

Figure 3 is an autoradiograph showing expression oflolA and lolC genes in LA- 
producing (+) and suppressed (-) cultures. In each lane was loaded 0.5 |ag of total cDNA 
synthesized from total RNA. cDNAs were probed with a mixture of a labeled 523 by fragment 
from lolA and a labeled 1427 by fragment from lolC. Identities of the hybridizing bands were 
confirmed in separate experiments with the individual probes (data not shown). Bottom panel 
shows expression of the tub2 as a control. Molecular sizes (in kilobases) are indicated, and 
correspond to bands of a DNA-size marker (HindlU/EcoRl-cut ADNA) run in the same gel. 

Figure 4 is a Southern blot of ///nt/III-digested genomic DNAs probed for lolA (panel 
A), lolC (panel B), and tub2 (panel C). Genomic DNAs were from N. lolii 138 (lane 1), E. 
festucae CBS 102477 (lane 2), E.festucae CBS 102475 (lane 3), and N. uncinatum CBS 102646 
(lane 4). Numbers adjacent to each blot indicate band sizes (in kilobases) of the molecular 
marker run in the same gel. For LA phenotype of each species/isolate see Table 3. 

Figure 5 demonstrates the presence of the lolA and lolC genes in endophyte species and 
isolates differing in LA production. Shown are electropherograms with 2 |il of PCR product 
loaded in each lane. The multiplex PCR generated a 523 bp product from lolA and a 461 bp 
product from lolC. The control PCR generated a 726 bp product from tub2. Numbers above 
each lane indicate species or isolate listed under the same number in Table 3; lanes B were PCR 
blanks run without added template DNA; lanes M are molecular size markers (sizes indicated in 
bp). 

Figure 6 illustrates the N. uncinatum lol clusters 1 (LOL1) (upper bar) and 2 (LOL2) 
(lower bar). It appears that the lolF2 allele and lolM are linked to LOL2 . 

DETAILED DESCRIPTION OF THE INVENTION 

Given the valuable agricultural properties of loline alkaloids, there is a need for methods 
and reagents for producing large quantities of loline-type alkaloids, for producing loline-type 
alkaloids in host cells that do not produce loline alkaloids naturally, and for producing novel 
loline-type alkaloids not found in nature. The present invention provides the protein encoding 
nucleic acids and methods that produce loline-type alkaloids, with particular application to 
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methods for producing the loline alkaloids and variants, hybrids, mutants, analogs, derivatives 
and novel compounds related through structure or genetics to loline alkaloid. 

The practice of the present invention will employ, unless otherwise indicated, 

.- - 

conventional methods of chemistry, microbiology, molecular biology and recombinant DNA 
techniques within the skill of the art. Such techniques are explained fully in the literature. See, 
e.g., Maniatis, et al. Molecular Cloning: A Laboratory Manual (Current Edition); DNA Cloning: 
A Practical Approach, vol. I & II (D. Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 
Current Edition); Nucleic Acid Hybridization (B. Hames & S. Higgins, eds., Current Edition); 
Transcription and Translation (B. Hames & S. Higgins, eds., Current Edition). 

All publications, patents and patent applications cited herein, whether supra or infra, are 
hereby incorporated by reference in their entirety. 

Definitions 

As used herein, the term "alkaloid-type compound" refers to a compound or molecule that 
is encoded by at least one native alkaloid subunit, or variant, hybrid, mutant, analog, or 
derivative thereof; including for example, without limitation, loline-type alkaloid. 

As used herein, the term "allele" refers to one of two or more alternate forms of a gene 
occupying the same locus in a particular chromosome or linkage structure and differing from 
other alleles of the locus at one or more mutational sites. Non-limiting types of alleles include, 
neutral, amorphs, hypomorphs, hypermorphs, antimorphs, neomorphs, isoalleles and unstable 
alleles. 

As used herein the term "coding sequence" or a sequence which "encodes" a particular 
protein, is a nucleic acid sequence which is transcribed (in the case of DNA) and translated (in 
the case of mRNA) into a polypeptide in vitro or in vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are determined by a 
start codon at the 5 f (amino) terminus and a translation stop codon at the 3' (carboxy) terminus. 
A coding sequence can include, but is not limited to, cDNA from procaryotic or eucaryotic 
mRNA, genomic DNA sequences from procaryotic or eucaryotic DNA, and even synthetic DNA 
sequences. A transcription termination sequence will usually be located 3' to the coding 
sequence. 
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As used herein the term DNA "control sequences" refers collectively to promoter 
sequences, ribosome binding sites, polyadenylation signals, transcription termination sequences, 
upstream regulatory domains, enhancers, and the like, which collectively provide for the 
transcription and translation of a coding sequence in a host cell. Not all of these control 
sequences need always be present in a recombinant vector so long as the desired gene is capable 
of being transcribed and translated. 

As used herein the term "functional gene cluster" refers to a set of genes (e.g., three or 
more) or subunits of a biosynthesis gene cluster, which catalyzes the synthesis of an active or 
functional alkaloid. 

As used herein the term "gene" refers to a segment of DNA or its complement that is 
involved in producing a polypeptide chain, including regions preceding (leader) and following 
(trailer) the coding sequence as well as intervening sequences (introns) between individual 
coding sequence (exons). A "loline alkaloid gene" refers to at least any of the ORFs of SEQ ID 
NO. 15 and SEQ ID NO. 16. 

As used herein the term "gene cluster" refers to a set of (e.g., three or more) closely 
related genes that code for the same or similar proteins and which are usually grouped together 
on the same chromosome. A "loline alkaloid gene cluster" refers to a set of genes (e.g., three or 
more) encoded by at least any of the ORFs of SEQ ID NO. 15 or SEQ ID NO. 16. 

As used herein the term "genetically engineered host cell" is meant a host cell where the 
native gene cluster or subunits thereof has/have been deleted using recombinant DNA 
techniques. Thus, the term would not encompass mutational events occurring in nature. A "host 
cell" is a cell derived from a procaryotic microorganism or a eucaryotic cell line cultured as a 
unicellular entity, which can be, or has been, used as a recipient for recombinant vectors bearing 
the alkaloid gene clusters of the invention. The term includes the progeny of the original cell 
which has been transfected. It is understood that the progeny of a single parental cell may not 
necessarily be completely identical in morphology or in genomic or total DNA complement to 
the original parent, due to accidental or deliberate mutation. Progeny of the parental cell, which 
are sufficiently similar to the parent to be characterized by the relevant property, such as the 
presence of a nucleotide sequence encoding desired biosynthetic enzymes, are included in the 
definition, and are covered by the above terms. 
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As used herein the term "loline alkaloid analog" or "analog" refers to a compound or 
molecule that resembles a loline alkaloid and that contains one or more structural differences 

relative to the loline alkaloid. Preferably, the loline analog has a desired activity of loline 

.* - 

alkaloid although a loline analog may have enhanced or the same activity than products of the 
loline alkaloid gene cluster. For example, the degree of saturation of at least one bond in the 
loline alkaloid structure can be changed (e.g., a single bond can be changed to a double or triple 
bond, or the converse), a bond can be removed, one or more carbon, oxygen or hydrogen atoms 
can be replaced with a different atom or a chemical moiety (e.g., a halogen, oxygen, nitrogen, 
sulfur, hydroxy, methoxy, alkyl, aryl, cycloalkyl, heterocycle, amine, amide, ketone, aldehyde, 
etc.) and the like. Also other peripheral groups, such as OH groups, methyl groups, O-methyl 
groups, halogene atoms etc. can be added, modified or removed. Other types of derivatives of 
loline that would be encompassed by the term "loline alkaloid analog" are known in the art. 
Non-limiting examples are norloline, N-methylloline, N-formylloline, N-formylnorloline, N- 
acetylloline and N-acetylnorloline. 

As used herein the term "loline alkaloid derivative" or "derivative" refers to a compound 
or molecule, that may be produced from loline in one or more steps or with few chemical or 
moiety modifications. 

As used herein the term "loline-type alkaloid" refers to a compound or molecule that is 
encoded by one or more native gene of, or a variant, hybrid, mutant, analog or derivative thereof, 
at least SEQ ID NO. 15 or SEQ ID NO. 16. 

As used herein, the term "modification enzyme" or "tailoring enzyme" refers to a protein 
or enzyme that is involved in modifying an alkaloid after its core has been synthesized by the 
necessary components to catalyze the production of an active or functional alkaloid. Exemplary, 
modification enzymes involved in loline-type alkaloid synthesis include, without limitation, 
oxidoreductases, dioxygenases and N-methyltransferase. 

As used herein, the term "modification step" or "tailoring step" refers to an action or 
actions taken by a protein or enzyme to modify an alkaloid after its core has been synthesized by 
the necessary components to catalyze the production of an active or functional alkaloid. 

As used herein the term "mutant" refers to a nucleic acid compound, protein, molecule, 
vector or cell resulting from mutation of the native wild type coding sequence or subunits 
thereof 
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As used herein the term "mutation" refers to any change that alters a native coding 
sequence either by displacement, addition, deletion, insertion, cross-linking, or other destruction 
or substitution of one or more nucleotides of the native coding sequence. Techniques for 
modifying nucleotide sfe^uences, such as site-directed mutagenesis, are also known in to those 
skilled in the art. 

As used herein the term "nucleic acid" sequence can include, but is not limited to, 
procaryotic sequences, eucaryotic mRNA, cDNA from eucaryotic mRNA, genomic DNA 
sequences from eucaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences, and 
complements thereof. The term also captures sequences that include any of the known base 
analogs of DNA and RNA such as, but not limited to 4-acetylcytosine, 8-hydroxy-N6- 
methyladenosine, aziridinylcytosine, pseudoisocytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 5- 
carboxymethylaminomethyluracil, dihydrouracil, inosine, N6-isopentenyladenine, 1- 
methyladenine, 1-methylpseudo-uracil, 1-methylguanine, 1-methylinosine, 2,2-dimethyl- 
guanine, 2-methyladenine, 2-methylguanine, 3-methyl-cytosine, 5-methylcytosine, N6- 
methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxy-aminomethyl-2- 
thiouracil, beta-D-mannosylqueosine, S'-methoxycarbonylmethyluracil, 5-methoxyuracil, 2- 
methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, 
oxybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4- 
thiouracil, 5-methyluracil, N-uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, 
pseudouracil, queosine, 2-thiocytosine, and 2,6-diaminopurine. A transcription termination 
sequence will usually be located 3' to the coding sequence. 

As used here the term "open reading frame" or "ORF" refers to a region of a nucleic acid 
molecule that contains a series of triplet bases coding for amino acids without any termination 
codons. An "open reading frame" includes any start codons. 

As used herein the term "replacement gene cluster" is meant any set of genes (e.g., three 
or more), optionally including genes encoding modification or tailoring enzymes, capable of 
producing a functional gene cluster when under the direction of one or more compatible control 
elements, as defined above, in a host cell transformed therewith. The term "replacement gene 
cluster" encompasses three or more genes encoding for the various proteins necessary to catalyze 
the production of an alkaloid. A "replacement gene cluster" need not include all of the genes 
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found in the corresponding cluster in nature. Rather, the gene cluster need only encode, but is 
not limited to, the necessary components to catalyze the production of an active alkaloid. For 

example, if the gene cluster includes, for example, eight genes in its native state and only three of 

.- - 

these genes are necessary to provide an active alkaloid, only these three genes need be present, 
and a variety of the non-necessary genes may optionally be present. The term, "replacement gene 
cluster" may also contain genes coding for modification or tailoring enzymes or tailoring 
enzymes to the core alkaloid catalyzed by the necessary components to catalyze the production 
of an active or functional alkaloid. Furthermore, a replacement gene cluster can include genes 
derived from a single species, or may be hybrid in nature with, e.g., a gene derived from a cluster 
for the synthesis of a particular alkaloid replaced with a corresponding gene from a cluster for 
the synthesis of another alkaloid. Hybrid clusters can include genes derived from different 
species. The genes included in the replacement gene cluster need not be the native genes, but 
can be variants, mutants or analogs thereof. Variants are prepared by methods known in the art 
(see Maniatis et al. Molecular Cloning: A Laboratory Manual (Current Edition)). Mutants or 
analogs may be prepared by the deletion, insertion or substitution of one or more nucleotides of 
the coding sequence. Techniques for modifying nucleotide sequences, such as site-directed 
mutagenesis, are described in the literature. The genes included in the replacement gene cluster 
need not be on the same plasmid or if present on the same plasmid, can be controlled by the same 
or different control sequences. 

As used herein, the term "subunit" refers to a part of a gene cluster including, for 
example, a module, domain, gene, or open reading frame, and parts thereof. A "subunit" may 
comprise for example, a gene or genes derived from a single species or may be hybrid in nature 
(e.g., a gene derived from a cluster for the synthesis of a particular alkaloid replaced with a 
corresponding gene from a cluster for the synthesis of another alkaloid.). A "subunit" may 
comprise variants, mutants, analogs or derivatives of the native gene(s). Variants, mutants, 
analogs or derivatives thereof may be prepared by techniques known to those of skill in the art, 
including, without limitation, the displacement, addition, deletion, insertion, cross-linking, or 
other destruction or substitution of one or more nucleotides of the coding sequence. Techniques 
for modifying nucleotide sequences, such as site-directed mutagenesis, are also known in to 
those skilled in the art. 
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As used herein the term "loline alkaloid variant" or "variant" refers to a nucleic acid 
sequence that hybridizes to an isolated nucleic acid sequence under high stringency conditions 

and has a desired or enhanced activity of the complement. Variants may include alleles, mutants, 

.♦ - 

hybrids, derivatives, or analogs. Variants also include the polypeptides coded for by these 
hybridizable nucleic acids. 

Identification of lolA and lolC 

Production of LA in N. uncinatum can be regulated by culture conditions, such as carbon 
and nitrogen source and pH in the culture medium, and is completely suppressed in a complex 
medium (Blankenship et al. y 2001, Phytochemistry 58: 395-401), suggesting differential 
expression of genes involved in LA biosynthesis. Isolation of the genes up-regulated during LA 
production is a first step in identifying possible enzymes in the biosynthesis of the LA. Different 
methods are now available for the isolation of differentially expressed genes (Diatchenko et al. 9 
1996, Proc. Natl. Acad. Sci. USA 93: 6025-6030; Liang and Pardee, 1992, Science 257: 967- 
971), incorporated herein. One such method, suppression subtractive hybridization (Diatchenko 
et al. 9 1996; Diatchenko et al. 9 1999, Meth. EnzymoL 303: 349-380) (incorporated herein), has 
been particularly useful for identifying differentially expressed genes. This technique was used 
herein to identify genes up-regulated in N. uncinatum during LA production. 

Culture conditions inducing or suppressing LA alkaloid accumulation in the fungus N 
uncinatum (Blankenship et aL, 2001) were used in combination with suppression subtractive 
hybridization for isolation of gene transcripts that are up-regulated during LA production. This 
approach was highly effective in enriching cDNAs differentially expressed in LA-producing 
cultures: subtracted cDNAs hybridized much more strongly with cDNAs from LA-producing 
cultures than with cDNAs from LA-suppressed cultures (See Figure 2). However, a few weak 
hybridizing bands were present in the total cDNA from the suppressed cultures, which was 
expected because very low LA levels accumulated in the suppressed cultures. Success of the 
approach was further indicated by the identification of lolA and lolC, genes that were present 
only in the species and isolates that produced LA, and, in the case of lolC, related to a sequence 
previously found to cosegregate with the LA-producing phenotype in Mendelian analysis of E. 
festucae (Wilkinson et aL, 2000, Mol. Plant-Microbe Interact. 13: 1027-1033). The 
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relationships of lolA and lolC to known biosynthetic enzymes further suggested that this 
approach identified transcripts of LA biosynthesis genes. 

The subtracted cDNAs comprised approximately 6-7% of all transcripts present in LA- 
producing cultures. This estimation appears reliable, since the number of independent clones in 
the cDNA library from LA-producing cultures (4.1 x 10 6 pfu) greatly exceeds the number of 
clones (1.0 x 10 6 pfu) estimated to be required for a library representing the complexity of the 
original mRNA population (Ausubel et a/., 2001). 

A relatively small number of the subtracted cDNAs and cDNA library clones were 
sequenced. Rather than conducting extensive sequencing, we focused on some of the cDNAs 
sequenced in this smaller survey, like lolA and lolC, giving significant similarity to known genes 
in amino acid biosynthesis/conversion, to further test their association with LA production. 
These cDNAs appeared promising candidates, since it has been hypothesized earlier that LA 
have polyamines as precursors, which in turn are products from amino acid metabolism. 

Among cDNAs isolated by the subtraction five independent clones from two alleles of 
genes designated lolA were identified. However, N. uncinatum has at least two copies oilolA . 
and lolC. The lolA alleles encode predicted proteins significantly similar to aspartate kinases, the 
first step in biosynthesis of methionine, threonine, and isoleucine from aspartate. In addition, 
one cDNA clone of a gene, lolC, with similarity to fungal enzymes in methionine biosynthesis 
was identified. Expression of lolA and lolC was clearly up-regulated in the LA-producing 
cultures compared to expression in the suppressed cultures. Further evidence for involvement of 
lolA and lolC in LA production was the distribution of these genes among the Neotyphodium and 
Epichloe species surveyed, of which eight species produce LA, 12 do not, and one (E.festucae) 
is polymorphic for this phenotype. Restriction of lolA and lolC to LA-producing endophytes 
indicated that both genes are either involved or physically linked to genes involved in the LA 
production phenotype. This observation, coupled with the observed up-regulation of lolA and 
lolC in the LA-producing cultures, lent support to an involvement of both genes in LA 
production. 

Generation of knock-outs of lolA and/or lolC will provide further evidence of their roles 
in LA production. However, preliminary evidence indicates that N. uncinatum has at least two 
alleles of lolA and the possibility of more than one allele of lolC. Thus, different approaches 
will be necessary to generate complete knock-outs, one of which could be disruption of the 
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putative lol genes in N. coenophialum for which procedures for knock-outs and double knock- 
outs have recently been developed. 

The ORFs of the lolA alleles in N. uncinatum predicted proteins with lengths of 

.* - 

approximately 210 amino acids, much shorter than the sizes of known aspartate kinases (for 
example, aspartate kinase of Sc. pombe, GenBank accession T39822, has a length of 5 19 amino 
acids). Potential reasons for this disparity could include truncation in the RT-PCR due to 
incomplete extension by the reverse transcriptase, or incorrect annealing of the 5'and 3 'end- 
specific cDNA primers to internal gene sequences. cDNA-based northern analysis (see Fig. 3), 
indicated a strong band of the expected size for an mRNA encoding 210 amino acids, whereas 
incomplete extension would probably have resulted in multiple bands or smear in the total 
cDNA. Moreover, despite being very close in size, the two allelic lolA cDNAs varied in the 
lengths of their 5' and 3' terminal sequences (not shown). Because of this difference, truncation 
due to incorrect primer annealing also appears unlikely, leaving the possibility that the lolA gene 
encodes a protein much shorter than known aspartate kinases. The predicted lolA amino acid 
sequences have similarity only to the C-terminal region of aspartate kinase, but not to the N- 
terminal regions, containing regions for substrate affinity and the active center (Arevalo- 
Rodriguez et al, 1999). A search of the PROSITE database further indicated that the predicted 
lolA sequences do not have an N-terminal consensus sequence typical of aspartate kinases. The 
C-terminal region of aspartate kinases, to which the predicted lolA products have similarity, is 
thought to be involved in allosteric response of the enzyme (Arevalo-Rodriguez et ah, 1999). It 
is thus possible that the predicted lolA proteins may have a binding site for an allosteric 
modulator similar to the modulators acting on aspartate kinase, which are normally allosterically 
regulated by the amino acids lysine, threonine, or isoleucine. 

Multiple steps have been identified for the biosynthesis of the more common plant 
pyrrolizidines, the senecio alkaloids. Senecio alkaloids are synthesized from polyamines, such 
as putrescine (derived from decarboxylated ornithine) and spermidine. In part because of their 
structural similarities with senecio alkaloids, a pathway from polyamines has been proposed for 
LA (Bush et a/., 1993). Relative positions of carbon and nitrogen atoms in the 1- 
aminopyrrolizidine structure (see Fig. 1) would be consistent with spermidine or a related 
compound as precursor, and spermidine is ultimately derived from the amino acids ornithine and 
methionine. Aspartate kinase and homocysteine synthase (or related enzymes) are steps in 
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biosynthesis of methionine, which in turn is a precursor to decarboxylated S- 
adenosylmethionine, the source of the aminopropyl moiety of spermidine. The association of 
lolA and lolC with LA production indeed suggests possible LA-biosynthesis from aspartate via 
methionine. However, the substantial differences between predicted lolA and known aspartate 
kinases may cast doubt on this possibility. Nevertheless, we have observed specific 
incorporation of 4-[ l3 C]-Asp into LA, indicating that aspartate is a precursor, although the exact 
sequence of biosynthetic steps remains to be established. Moreover, lolC also had similarity to 
an enzyme in the biosynthesis of rhizobitoxine, a bacterial product which enhances nodulation. 
The activity of this enzyme encompasses formation of serinol and dihydrorhizobitoxine 
biosynthesis, thus synthesis of compounds different from methionine precursors, further 
indicating that LA biosynthesis could differ from common amino acid and/or polyamine 
biosynthesis. 

Another cDNA obtained with the subtraction had similarity to a putative zinc-finger 
transcription factor. Interestingly, in fungi such as Fusarium sporotrichioides and Em, nidulans, 
transcriptional regulators can be part of secondary metabolite pathway clusters, raising the 
possibility that a specific transcriptional regulator also exists for LA genes. The probable 
transcription factor found here has similarities to C2H2 zinc-finger transcription factors. A 
C2H2-like transcription factor was found to be involved in the control of genes in the 
biosynthesis of trichothecene, a secondary metabolite produced by F. sporotrichioides. In our 
study, however, detection of the C2H2-like gene did not correlate with LA production in 
endophytes. Therefore, it likely that this putative transcription factor might be specifically 
expressed in N. uncinatum under the culture condition used to induce LA production, but may 
not be a specific regulator for LA biosynthesis genes. Another possibility, however, is that this 
factor regulates LA genes in N. uncinatum, but different factors regulate the orthologous genes in 
other endophyte species. In fact, loline alkaloids are not produced by other endophyte species in 
these culture conditions despite the presence of lolA and lolC, and despite their production of LA 
when symbiotic with plants. Therefore, the possibility of a unique regulator of LA synthesis in 
N. uncinatum warrants further investigation. 

Other genes up-regulated during LA production that gave significant matches with 
known genes or sequences were a putative homing endonuclease, generally associated with 
unusual DNA splicing and incorporation events, and significant matches of cDNAs to sequences 
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in the Ns. crassa genome. However, for none of these genes do we currently have direct 
evidence for involvement in LA production. One sequence identified in four clones (P3, K8, 
C37, D5) was also detectable (by Southern blot) in at least one non-producer, E.festucae CBS 
102477, but not in the LA producers N. coenophialum ATCC 90664 and E.festucae CBS 
102475 (data not shown), suggesting that this gene is not involved in LA production. 

Further investigations into biosynthesis of the loline alkaloids. 

Very little is known about the regulation of secondary metabolism in grass endophytes 
and many other fungi. The approach used here is a crucial step towards elucidating the 
biosynthesis of LA, allowing the isolation of genomic copies of N. uncinatum genes closely 
associated with LA production. Secondary metabolite pathway genes are frequently clustered in 
fungal genomes (Keller and Hohn, 1997, Fung. Genet Biol 21: 17-29; Seo et al. 9 2001, Fung. 
Genet. Biol. 34: 155-165;; Tudzynski et al, 1999, Mol Gen. Genet. 261: 133-141). The finding 
of genes associated with LA production now permits investigations of potential clustering of LA : 
genes in genomes of LA-producing endophytes. 

Example 1 

All chemicals (including antibiotics) and reagents used in the experiments described in 
the examples below were obtained from Sigma Corp. (St. Louis, MO, USA), unless indicated 
otherwise. Growth media were from Difco Laboratories (Detroit, MI, USA). Agarose for DNA 
and RNA gel electrophoreses was supplied by BioWhittaker Molecular Applications (Rockland, 
ME, USA). For routine PCR of templates <1.0 kb, AmpliTaq Gold (Applied Biosystems, Foster 
City, CA, USA) was used. PCRs for cDNA synthesis, suppression subtractive hybridization, and 
templates >1.0 kb were performed with the Advantage cDNA PCR Kit (Clontech, Palo Alto, 
CA, USA). 

Fungal cultures and analyses of loline alkaloids. 

Mycelium of Neotyphodium uncinatum (voucher specimen CBS 102646 at 
Centraalbureau Voor Schimmelcultures, Utrecht, The Netherlands) was isolated from grass leaf 
tissues [meadow fescue (Lolium pratense = Festuca pratensis), plant 167 in our plant collection] 
on potato dextrose agar as previously described (Blankenship et at, 2001). The following 
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procedures were carried out as described by Blankenship et al (2001) with modifications. After 
21 days of growth at 22 °C on PDA plates, 10 fungal colonies were transferred to, and 
homogenized in, 20 ml of LA-inducing medium (Blankenship et a/., 2001) with 15 mM 
asparagine and 20 mM sucrose as the nitrogen and carbon sources, respectively. Ten ml of the 
homogenate was added to a 500-ml Erlenmeyer flask with 100 ml of fresh LA-inducing medium, 
and the culture incubated at 22 °C with rotary shaking (100 rpm). After five days of growth, 
mycelium was harvested in 50-ml tubes (Falcon, distributed by Becton Dickinson Labware, 
Lincoln Park, NJ, USA) by centrifugation (2000 x g rcf), and the mycelium homogenized in 20 
ml LA-inducing medium as described. To initiate main cultures for LA production, 1 ml of 
homogenized mycelium was added to 25 ml of LA-inducing medium and cultures were 
incubated as described above. To suppress LA production in cultures, but maintain growth 
conditions similar to the minimal medium, potato dextrose broth was added to give half-strength 
final concentration in the medium, and asparagine and sucrose were added to 7.5 mM and 10 
mM final concentration, respectively. Except for this variation in medium composition, all 
growth conditions and source of inoculum for LA-suppressed cultures were the same as for LA- 
induced cultures. Cultures of N. uncinatum were grown under the conditions inducing or 
suppressing LA accumulation, and harvested during early accumulation when LA levels in the 
producing medium were <20 jig ml" 1 . (Levels in similar cultures later reached >1000 (ig ml* 1 in 
producing, but <10 \ig mr" 1 in suppressed cultures.) 

LA extraction from freeze-dried culture filtrates or plant tissues, and quantitation by gas 
chromatography (GC) analysis, were performed as described by Blankenship et al. (2001). 

RNA extraction, DNase treatment, and analysis. 

Mycelium was harvested by vacuum filtration through Whatman No. 1 filter paper 
(Whatman International Ltd, Maidstone, England, UK) and total RNA was extracted from 0.2 - 
0.3 g (fresh weight) mycelium with the RNeasy Plant Minikit (Qiagen Inc, Valencia, CA, USA). 
Co-purified DNA was removed with the DNA-free™ kit (Ambion, Austin, TX, USA) by treating 
the extracts (50 ^1) with 2 units of DNasel for 30 min at 37 °C, whereupon DNase activity was 
stopped with DNase Inactivation Reagent (Ambion). Purified RNA was quantified by measuring 
absorbance at 260 nm and 280 nm in a Genequant spectrophotometer (Amersham Pharmacia 
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Biotech, Piscataway, NJ, USA). Integrity of the total RNA was checked by electrophoresis in 
1 .2% formaldehyde agarose gels. 

cDNA synthesis and suppression subtractive hybridization. 

Total RNA was extracted from LA-producing and LA-suppressed cultures. However, 
low mycelial biomass resulted in low RNA yields. To obtain enough cDNA for subtractive 
hybridization and expression analysis (cDNA-based Northerns; Endege et a/., 1999, 
BioTechniques 26: 542-548), cDNA was synthesized and amplified with the SMART™ PCR 
cDNA Synthesis Kit (Clontech). Three |il of RNA solution (300 ng/jil) was reverse transcribed 
with Superscript™ II following the instructions of the manufacturer (Gibco BRL, Grand Island, 
NY, USA). The reverse-transcription reaction was diluted with TE buffer to a total volume of 50 
\xl. Amplification of cDNA by long-distance PCR was carried out according to the protocol of 
the SMART™ PCR cDNA Synthesis Kit (Clontech) in a GeneAmp PCR System 2400 
thermocycler (Perkin Elmer Inc., Boston, MA). One ^il of the diluted reverse-transcription 
reaction was used, and the number of PCR cycles required for optimum amplification of cDNA 
was determined according to the manufacturer's protocol (Clontech). The amplification step 
allows bulking up on cDNA, while likely maintaining the complexity of the original RNA 
population. 

Suppression subtractive hybridization (Diatchenko et al., 1996; Diatchenko et al. 9 1999) 
was performed with the PCR-Select™ cDNA Subtraction Kit (Clontech) essentially as described 
in the Clontech PCR-Select™ manual. The PCR-Select procedure consists of Rsal digestion of 
cDNA, ligation of digested tester DNA (containing differentially expressed genes of interest) to 
two adaptors (1 and 2R, specified in the manual), and two rounds of hybridization with driver 
DNA used to subtract out cDNAs not differentially expressed in the tester, followed by 
amplification of the subtracted cDNA by PCR with primers specific to the adaptors. Primary 
PCR is followed by secondary PCR with nested primers. Only DNA fragments carrying 
different adaptors at each end tend to amplify exponentially. 

cDNA previously amplified with the cDNA Synthesis Kit was purified and digested with 
Rsal. The digested cDNA was cleaned up with the PCR Purification Kit (Qiagen), eluted into 50 
of elution buffer, and ethanol precipitated, and adaptors ligated to the tester DNA. In the first 
hybridization, 13 ng of adaptor-ligated tester was mixed with 147 ng of driver in two separate 
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reactions (each reaction with adaptor 1 and 2R, respectively) and, after denaturation (98 °C for 
1 .5 min), were allowed to anneal for 9 hr at 68 °C. After this first hybridization, the two 
reactions were combined in the presence of 98 ng of denatured fresh driver and a second 
hybridization performed" for 16 hr at 68 °C. Amplification of tester-tester hybrids was performed 
as described in PCR Purification Kit manual. Efficiency of the ligation to the adaptors and of the 
subtraction was tested and confirmed as called for in the protocol, using two primers (5'- 
GTTGATCTCC AAGATCCGTGAGG-3 ' (SEQ ID NO. 1) and 5'- 
GTTTCGTCCGAGTTCTCGAC-3') (SEQ ID NO. 2) specific to the p-tubulin gene (tub!). 

Upon completion of suppression subtractive PCR, a portion of the product mixture was 
ligated into pCR®4Blunt-TOPO®, using the Zero Blunt TOPO PCR Cloning Kit (Invitrogen, 
Carlsbad, CA, USA), and electroporated into TOP 10 cells provided with the kit, to obtain a 
subtracted expressed sequence tag (EST) bank. Another portion was used to generate 
hybridization probe PCR-labeled with digoxigenin (DIG) following the protocol of the 
manufacturer (Roche-Boehringer, Indianapolis, IN, USA). 

cDNA library construction. 

cDNA synthesis and library construction were performed with the SMART™ cDNA 
Library Construction Kit (Clontech) according to the manufacturer's instructions. First-strand 
cDNA was synthesized with the same amount of RNA as used in the cDNA synthesis for the 
subtraction, and 2 /A undiluted first-strand reaction was used as template to amplify the cDNA. 
The amplified cDNA was digested with 5/zI, size fractionated for removal of low-molecular-size 
(<0.1 kb) cDNA, and ligated into ATriplEx2 vector (Clontech). cDNA ligated into vector was 
added to X phage Gigapack III Gold packaging extract (Stratagene, La Jolla, CA, USA), and 
titered in E. coli strain XL 1 -Blue as specified by the manufacturer. For cDNA library 
amplification, overnight cultures of XL 1 -Blue were inoculated with an amount of packaged 
phage suspension to yield 1.0 x 10 5 pfu per 150 mm plate (Falcon); in total, 15 plates were 
prepared, so the amplified library was derived from 1.5 x 10 6 primary clones. After incubation 
overnight at 37 °C, to each plate was added 12 ml of X dilution buffer (100 mM NaCl, 10 MM 
MgS0 4 , 35 mM Tris-HCl, pH 7.5, 0.01% gelatin), followed by 20 hr incubation at 4 °C. The 
phage suspensions were then titered for each plate. Since differences in titer between plates 
would affect representation of cDNA clones in the final amplified library, the appropriate 
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volume of each suspension was determined so that, when combined, each plate contributed 
equally to the total number of pfu in the pooled library. After pooling, the titer of the amplified 
library was 5.4xl0 9 pfu ml' 1 . 

Southern blot and PCR analysis of genomic DNA. 

Fungal genomic DNA was isolated by the method of Al-Samarrai & Schmid, 2000, Lett. AppL 
Microbiol 30: 53-56. Because Neotyphodium occultans does not grow autonomously in culture, 
DNA from the Lolium multiflorum-N. occultans symbiotum was isolated by the method of Doyle 
and Doyle, 1990, Focus 12: 13-15 for PCR analysis. 

Probes for Southern-blot, dot-blot and cDNA-based northern-blot hybridizations were 
labeled with DIG as described above. Total subtracted cDNA was labeled by using the primary 
PCR product in the subtraction as template and the nested PCR primers supplied with the PCR- 
Select™ cDNA Subtraction Kit (Clontech). Probe for lolA was a labeled 523 by fragment 
generated by PCR using primers lolA-5' (S'-GTCTGGCGAATTCTACAGAC ACG-3 ') (SEQ ID 
NO. 3) and lolA-3' (5 '-GATGGCCATGTGAGGAAAGAG-3 ') (SEQ ID NO. 4). A labeled 
1427 by fragment of the lolC gene (see Results) was generated by PCR with primers lolC-5' (5'- 
CGGTGCGCGTCTTCTAAACTTGAC-3 ') (SEQ ID NO. 5) and lolC-3' (5'- 
GAATCTTTCCGATGC AAGGCTT ACG-3 ') (SEQ ID NO. 6). 

cDNA-based northerns were performed with complete cDNA, which was gel fractionated 
and Southern blotted to Hybond™-N+ nylon membranes (Amersham Pharmacia Biotech). 
Southern blotting of DNA by alkaline transfer, as well as dot blotting onto Hybond™-N+ nylon 
membranes (Amersham Pharmacia Biotech) and DNA hybridizations were accomplished with 
standard protocols (Ausubel et aL, 2001). Membranes were washed with O.lx SSC, 0.1% SDS, 
once for 15 min at room. temperature, then for 20 min and again for 30 min at 75 °C (membranes 
with cDNA) or at 65 °C (membranes with genomic DNA). Chemiluminescent detection of 
probes hybridized to DNA with anti-DIG antibodies was performed according to the protocol of 
the supplier (Roche-Boehringer). To visualize labeled probes hybridizing to DNA, membranes 
were exposed to Hyperfilm™ ECL™ Chemiluminescence film (Amersham Pharmacia Biotech). 

PCR screening for lolA was performed on endophyte genomic DNA with primers lolA-3' 
and lolA-5'. PCR screening for lolC employed primers lolC-3'a (5'- 
GGTCTAGTATTACGTTGCCAGGG-3 ') (SEQ ID NO. 7) and lolC-5'a (5'- 
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GTTGCCCACGGTGCGCGTCTTC-3 ') (SEQ ID NO. 8). PCR was performed with 35 cycles 
of 95 °C for 30 s, 62 °C for 30 s, and 72 °C for 1 min. As a positive control for DNA integrity in 
this screening, a tub2 gene fragment was amplified by PCR with primers 5'- 

r m 

TGGTC AACC AGCTC AGC ACC-3 ' (SEQ ID NO. 9) and 5 ' -G AG AAAATGCGTG AG ATTGT- 
V (SEQ ID NO. 10) (Byrd et ai, 1990), with 35 cycles of 95 °C for 30 s, 55 °C for 30 s, and 72 
°C for 1 min. 

cDNA library screening and conversion of phage to plasmid clones. 

Screening of the cDNA library was essentially as described by Ausubel et al (2001). 
Phage were plated in a lawn of £*. coli XL 1 -Blue, and phage lifts were on Hybond™-N+ nylon 
membranes (Amersham Pharmacia Biotech). To convert clones in ATriplEx2 to plasmid form, 
plaques were added to E. coli strain BM25.8 (which expresses Cre-recombinase) as per the 
supplier's protocols (Clontech). Single, isolated colonies were selected on LB agar with 
ampicillin, picked and grown in LB with ampicillin, and plasmids isolated by a rapid alkaline 
procedure (Ahn et aL, 2000, BioTechniques 29: 266-368). To verify that a plasmid carried the 
desired insert, 3 ^1 of each plasmid was spotted onto a nylon membrane for dot blotting, and the 
membrane hybridized to the probe initially used to identify the X-phage clone. 

Plasmid DNA isolation, sequencing, and database search of cDNAs. 

Plasmid DNA was isolated from bacterial cells by the rapid alkaline miniprep procedure 
(Ahn et al. 9 2000). Plasmid inserts were sequenced with primers L-triplEx 5' (5'- 
TCCGAGATCTGGACGAGC-3 ') (SEQ ID NO. 11) and L-triplEx 3* (5'- 
TAATACGACTCACTATAGGG-3 ') (SEQ ID NO. 12), specific to vector regions flanking the 
cDNA inserts. DNA cloned into TOPO vector (Invitrogen) was sequenced with M13-reverse 
(5 '-C AGG AAAC AGCTATGAC-3 *) (SEQ ID NO. 13) and M13-forward (5'- 
GTAAAACGACGGCC AG-3 ') (SEQ ID NO. 14) primers. Sequencing of DNA was performed 
with the BigDye Terminator Cycle Sequence Kit (Applied Biosystems) on an ABI 310 
automated sequencer (Applied Biosystems) or, for high-throughput sequencing, the CEQ2000XL 
DNA Analysis System (Beckman-Coulter, Fullerton, CA, USA) with the CEQ™ DTCS - Quick 
Start Kit (Beckman-Coulter). DNA sequences obtained were entered into the basic local 
alignment search tool (BLAST; Altschul et aL, 1997) programs at the National Center for 
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Biotechnology Information site (NCBI; http://www.ncbi.nlm.nih.gov/BLAST/) to search the 
nonredundant nucleic acid (nr) database, and at the Whitehead Institute site (http://www- 
genome.wi.mit.edu/ annotation/fungi/neurospora/) to search the Neurospora crassa database for 
similar sequences. Matches with known DNA/protein sequences in these databases were 
considered significant at E < 10" 4 . Predicted protein sequences were analyzed for occurrence of 
biologically significant sites by searching the database of protein families and domains 
(PROSITE) at ExPASy (Expert Protein Analysis System; http://ca.expasy.org/). 

Example 2 

Transcripts up-regulated in loline alkaloid-producing cultures. 

To gauge the success of the suppression subtractive hybridization, and to get an overview 
of the number and sizes of cDNAs potentially enriched by the subtraction, cDNA-based northern 
analysis on total cDNA from the cultures were conducted (Figure 2). The subtracted cDNAs 
hybridized much more strongly with total cDNA from the LA-producing cultures than to total 
cDNA from the suppressed cultures, demonstrating enrichment of cDNAs up-regulated in the 
LA-producing cultures. Multiple sizes of cDNAs hybridized with the subtracted cDNAs, 
indicating up-regulated expression of several different genes in the LA-producing cultures. 

The XTriplEx2 library created with complete cDNA from LA-producing cultures 
contained a total of 4.1 x 10 6 primary pfu. From this, 1.5 x 10 6 primary pfu, likely representing 
the complete cDNA population obtained from the LA-producing cultures, were amplified to 5.4 
x 10 9 pfu ml' 1 . 

The number of recombinant clones (= likely containing cDNA inserts) in the unamplified 
library was assessed by blue-white screening of the plated library at 3.3 x 10 6 pfu (>80% of all 
primary library clones). To determine the percentage of library clones that contained transcripts 
up-regulated in the LA-producing cultures, phage lifts were probed with total subtracted cDNA. 
Approximately 5.3% of all clones (about 6.6% of the presumed recombinant clones) hybridized 
with the subtracted cDNA (data not shown). 

Example 3 

Several subtracted cDNA similarities with database sequences. 
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Twenty clones from the subtracted EST bank, and six library clones hybridizing to total 
subtracted cDNA were sequenced. In the suppression subtractive method incomplete 
suppression can result from amplification of tester-tester hybrids which have only one of the two 
adaptors at each end. However, all ESTs sequenced from the bank of subtracted cDNAs had the 
two different adaptors at their ends, indicating that contaminating background due to non- 
specific amplification of tester was very low. 

Sequences of subtracted cDNA clones, as well as inserts in library clones that hybridized 
to subtracted cDNAs, were used to query databases by various BLAST algorithms. For several 
subtracted cDNAs (Table 1) and cDNAs from the ^-library (Table 2), matches to known protein 
sequences in NCBI, or sequences in the Ns. crassa database were identified. Five library and 
subtracted clones had significant similarity to the C-terminal amino acid regions of aspartate 
kinases (Tables 1 & 2). Two of the library clones with similarity to aspartate kinase were 
identical to each other in sequence (P2 and PI 6). However, a third library clone (PI 7) differed 
from the other two (94% identity), but had 100% sequence identity with two subtracted clones 
(B8 and C5). The detected variation in sequence among the clones suggested more than one 
form of this gene in N. uncinatum. The presence of two genomic alleles of the AspK-related 
gene in N. uncinatum was verified by PCR with primers with allele-specific nucleotides at their 
3'-ends (data not shown). Because other results of this study (described below) strongly 
associate these sequences with LA production, we will hereafter designate the corresponding 
genes as lolA alleles; the two allelic sequences have been submitted to GenBank (accessions 
AF439396 and AF439395). 

The predicted proteins (209 and 210 aa) encoded by lolA alleles were smaller than known 
aspartate kinases (which usually exceed 500 aa) and had similarity only to the C-terminal regions 
of aspartate kinases. This was indicated by protein sequence alignments with known aspartate 
kinases that gave the most significant matches in BLAST. These proteins, from 
Schizosaccharomyces pombe and Saccharomyces cerevisiae (GenBank accessions T39822 and 
PI 0869), aligned with a region starting at amino acid position 47 and ending at amino acid 204 
of the predicted lolA proteins and had 25-26% identity to lolA, but only within a region of the 
known aspartate kinases starting at about amino acid 351 and ending at amino acid 495. 
PROSITE searches with the predicted amino acid sequences of the two lolA alleles indicated that 
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both lacked the aspartate kinase signature, defined by PROSITE as [LIVM|-x-K-[FY]-G-G- 
[ST]-[SC]-[LIVM], a conserved region located near the N-terminal end of aspartate kinases. 

One subtracted clone, D6 (Table 1), gave highly significant matches with fungal (Ns. 
crassa, Emericella nidulans, and S. cerevisiae) genes for 0-acetylhomoserine-(thiol)lyase 
(homocysteine synthase), and the related enzymes, cystathionine y-synthase and cystathionine (J- 
lyase, all of which are y-type pyridoxal phosphate-containing enzymes in sulfur-containing 
amino acid biosynthesis and interconversion pathways. Additionally, significant similarity was 
found with an enzyme in the biosynthesis of the bacterial compound rhizobitoxine. The 
molecular size of the transcript (between 1.5 to 2 kb, Fig. 3) predicted a protein similar in size to 
known homocysteine synthases, which are 430-450 amino acids (Sienko et aL, 1998). A related 
sequence was recently identified at a locus associated with LA production in Epichloe festucae 
(Wilkinson et al y 2000; Spiering et a/., 2000). Because these data and further evidence 
presented below associated this sequence with LA production, the corresponding gene was 
designated lolC (GenBank accessions AF461 175, AF461 176). 
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Example 4 

Association of the lolA -and loIC genes with loline production. 

Genomic sequences of loIC and one allele of lolA (clone PI .7) were obtained by using 
primers based on the cDNA of the lolA gene and genomic sequence of the loIC gene from E. 
festucae (data not shown). This information was used to design primers for specific probes and 
detection of lolA and loIC sequences in cDNA-based northern analysis of complete cDNAs from 
LA-producing and suppressed cultures (Figure 3). Both sequences were expressed in the LA- 
producing cultures. Strong hybridizing bands were detected from the complete cDNA from LA- 
producing cultures, whereas faint bands were obtained from the complete cDNA from the 
suppressed cultures. 

LA production is a trait specific to endophyte species (Christensen et al. 9 1993; Siegel et 
aL, 1990; TePaske and Powell, 1993) or even isolates within species (Wilkinson et a/., 2000). 
Consequently, we reasoned that genes associated with LA production would be present in all 
LA-producing endophytes, but might be absent from endophytes that do not produce LA. For 
many endophyte species and isolates available from our collection the LA phenotypes were 
known from the literature (Table 3), and these were confirmed by GC analyses of plants 
symbiotic with these endophytes. Additional species or isolates included in this survey were 
similarly assessed for LA production (Table 4). In Southern-blot analysis of genomic DNAs 
from two LA producers and two nonproducers, lolA and loIC sequences hybridized only with 
DNA from the endophytes that produce LA (Figure 4). The probes used to detect lolA and loIC 
did not have sites for the restriction enzyme used in the genomic digests, so for each putative 
allele one hybridizing band was expected. In N. uncinatum, two bands were observed from the 
genomic DNA probed with lolA, indicating at least two alleles of this gene; hybridization with 
loIC gave only one band, suggesting only one allele of this gene was present, but the possibility 
that this single band represented multiple alleles oflolC could not be excluded. In E. festucae, 
hybridization with the two probes gave one strong hybridizing band for each, suggesting one 
allele of each gene. The additional, fainter hybridizing bands present on the blots corresponded 
to some bands on the ethidium bromide-stained gel (not shown) and were, therefore, likely due 
to non-specific binding of the probes to mitochondrial or repetitive genomic DNA. 
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Diagnostic PCR was used with primers specific to the lolA and lolC genes for detection 
of these sequences in all species and isolates listed in Table 3. Detection of the lolA and lolC 
genes in endophytes was strictly associated with the LA-producing phenotype (Figure 5). In 
addition, the two genes were detected in N. chisosum (ATCC 64037). 

The high expression of lolA and lolC in LA-producing cultures of N. uncinatum, and the 
strict correlation of LA production with presence of the two genes in the different endophytes, 
lent strong support to involvement of the lolA and lolC genes in LA production. 
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Table 3. LA phenotype of endophyte species and isolates used in this study. Indicated are the respective grass hosts 
which were used in the determination of the LA, and from which the endophytes in this study were originally 

isolated. 



Species/isolate 9 


Grass host 


Loline 
phenotype 10 


Reference" 


l) 12 Epichloe festucae CBS 102477 


Festucae rubra 




1 


2) E.festucae CBS 102475 


N/A 13 


+ 


2 


3) E. typhina 8 


Lolium perenne 




3 


4) Neotyphodiurn aotearoae CBS 109345 


Echinopogon ovatus 


+ 


4 


5) N. aotearoae ATCC MYA-123 1 


E. ovatus 


+ 


4 


6) N australiense CBS 109346 


E. ovatus 





4 


7) M coenophiolum ATCC 90664 


Lolium arundinaceum 


+ 


3 


8) N. huerfanum ATCC 604040 


Festuca arizonica 





3 


9) N. inebrians818 


Achnatherum inebrians 





5 


10)/V. fottl38 


L. perenne 





3 


1 1) N melicicola CBS 109342 


Melica decumbens 





4 


12) At occultans 999 


Lolium multiflorum 


+ 


6 


13) At siegelii ATCC 74483 


Lolium pratense 


+ 


7 


14) Neotyphodiurn sp. 55 


Poa autumnalis 


+ 


3 


15) Neotyphodiurn sp. 87 


Festuca paradoxa 




3 


16) Neotyphodiurn sp. LpTG-2 Lpl 


L. perenne 




8 


17) Neotyphodiurn sp. 269 


Hordeum bogdanii 




4 


18) Neotyphodiurn sp. 270 


Hordeum brevisubulatum 




4 


19) Neotyphodiurn sp. 361 


Hordelymus europaeus 




9 


20) Neotyphodiurn sp. FaTG-3 Tfl8 


L. arundinaceum 


+ 


4 


21) Neotyphodiurn sp. FaTG-2 Tfl4 


L. arundinaceum 




4 


22) Neotyphodiurn sp. 4096 


Achnatherum robustum 




4 


23) N uncinatum CBS 102646 


L. pratense 


+ 


7 



9 CBS = Centraalbureau voor Schimmelculrures, Fungal Biodiversity Center, Utrecht, The Netherlands 
(http://www.cbs.knaw.nl), ATCC = American Type Culture Collection, Manassas, Virginia, USA 
(http://www.atec.org). Other designations are from the referenced papers or are laboratory isolate numbers. 

10 = »+» = loline-producing, "-"= loline non-producing. 

11 References are (1) Leuchtmann and Schardl (1998), (2) Wilkinson et al (2000), (3) Siegel et al. (1990), (4) this 
study (see Table 4), (5) Miles et al (1996), (6) TePaske and Powell (1993), (7) Craven et al (2001), (8) Christensen 
et al, (1993), (9) Leuchtmann et al, (2000). 

12 Numbers before each isolate correspond to the numbers indicated above gel lanes in Figure 6. 

13 N/A = not applicable. Isolate derived by mating E. festucae CBS 102477 with an E. festucae isolate CBS 102474 
from Lolium giganteum. 
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Table 4. LA in plants with endophyte species for which the LA phenotype was previously unknown. 



opecies/isoiaie 


Unci' 

riosi grass 


T A 1 ft W% AC ' 


Neotyphodium aotearoae CBS 109345 


Echinopogon ovatus 


1780 


N aotearoae ATCC MYA-1231 


E. ovatus 


2120 


N. australiense CBS 109346 


E. ovatus 


nd 


N. melicicola CBS 109342 


Melica decumbens 


nd 


Neotyphodium sp. FaTG-3 Tfl8 


Lolium arundinaceum 


670 


Neotyphodium sp. FaTG-2 Tfl4 


L. arundinaceum 


nd 


Neotyphodium sp. 269 


Hordeum bogdanii 


nd 


Neotyphodium sp. 270 


Hordeum brevisubulatum 


nd 


Neotyphodium sp. 270 


Stipa robusta 


nd 



14 CBS = Centraalbureau voor Schimmelcultures, Fungal Biodiversity Center, Utrecht, The Netherlands 
(http://www.cbs.knaw.nl), ATCC = American Type Culture Collection, Manassas, Virginia, USA 
(http://www.atcc.org). Other designations are from the referenced papers or are laboratory isolate numbers. 

15 Reported is the sum of yV-formyl and /V-acetyl lolines in ug g' 1 dry weight plant tissue, nd = not detected (limit of 
dection= 10 ug g' 1 ). 
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Example 5 

Additional subtracted cDNAs matching known genes and genomic sequences. 

As shown in Tables 1 & 2, several other cDNAs isolated by the subtraction method also 
gave highly significant-matches in BLAST searches of the nr and Ns. crassa databases. Matches 
included a zinc-finger transcription factor, a hypothetical protein in S cerevisiae, and a homing 
endonuclease. Additionally, matches with Ns. crassa sequences were identified for putative 
ORFs of one hybridizing library clone, P15, and one subtracted clone, A8. 

A survey of the distribution of the putative zinc-finger transcription factor among eight 
endophytes differing in LA production (four LA producers and four non-producers) was 
performed by diagnostic PCR. There was no association of this putative zinc-finger transcription 
factor gene with LA production; its presence was detected only in two isolates, one LA-producer 
(N uncinatum) and one non-producer {N. huerfanum) (data not shown). 

As further indicated in Tables 1 & 2, a number of sequences from library and subtracted 
clones gave no significant matches with known genes in the nr database and sequences in the Ns. : 
crassa genome. For one subtracted clone, N17, the full-length cDNA sequence was obtained by 
PCR, using an aliquot of the amplified cDNA library and gene and vector-specific primers. A 
predicted ORF of 363 amino acids was found within the N17 cDNA (data not shown), but this 
amino acid sequence did not give significant matches with any genes or sequences in the nr and 
Ns. crassa databases or known protein patterns in the PROSITE database. 

Example 6 

Identification of the LOL1 and LOL2 Gene Clusters 

Central to the present invention is the identification of the loline alkaloid gene clusters 
LOLl (SEQ ID NO. 15), and LOL2 (SEQ ID NO. 16) which apparently may also include lolF2 
and lolM (SEQ ID NO: "17). The association in Neotyphodium uncinatum of lolA and lolC was 
tested by long-distance-PCR. The 8.2 kb product contained the expected sequences of both, plus 
two additional open reading frames between them. We then walked outward from this fragment 
by vectorette-mediated PCR, and in the process identified two gene clusters (LOLl and LOL2 in 
Figure 6). 

In addition to lolC and lolA, at least 8 (LOLl) or 7 (LOL2) ORFs were inferred within 
LOLl and LOL2 by using a program with an algorithm for prediction of fungal genes. PCR 
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analyses/Southern hybridization on a cDNA library/total cDNA from N. uncinatum showed 
expression of the ORFs lolM y lolF, lolC, lolO t lolA, and lolE, indicating that these contain active 
genes. The details of the gene predictions and coordinates, i.e., location of the exons in the 
ORFs of LOLl and LOL2 are given below. LOLl and LOL2 differ in sequence (LOLI has -95% 
nucleotide sequence identity to LOL2), thus represent two truly distinct genomic regions. 
Altogether, ten genes were inferred in the gene clusters, with most of the genes shared between 
the clusters. PCR and Southern-blot analyses indicated that all ten genes were unique to the 
loline alkaloid producers among the isolates surveyed in Table 3 (Fung. Genet. Biol. Spiering et 
al.). Nine of the genes, lolE, lolT, lolP, lolU, lolA, lolO, lolD, lolC and lolF, were found in two 
allelic forms. 

The amino acid sequences deduced from the LOL gene ORFs gave highly significant 
matches (E< 10' 5 ; except lolU and lolM for which E>0.01) with known enzyme sequences in the 
protein databases curated by the National Center of Biotechnology Information 
( http://www.ncbi.nlm.nih.gov ; NCBI). The gene functions predicted by Genbank searches of the 
databases at NCBI, and gene orientations within the clusters, thereby indicate that LOL2 contains 
eight genes (i.e., lolE, lolT, lolP, lolU, lolA, lolO, lolD, and lolQ representing alleles of genes 
present in LOLl. LOLl contains an additional gene, named lolF t hitherto not found in LOL2. 
The genomic location of the additional ORFs, lolF2 and lolM, relative to the two LOL clusters is 
presently unknown, but we postulate that lolF2 and /o/Mare located close to LOL2. lolF2 and 
sequence adjacent to it has -93% identity to lolF (and sequence adjacent to it) in cluster LOLL 

The LOLl gene cluster spans about a 25.3 kB region and consists of 9 ORFs. Open 
reading frames of LOLl are indicated relative to nucleotide numbers annotated to SEQ ID NO: 
15; mRNA sequences of each gene are given by joined exons determined by cDNA sequencing 
or predicted by the fgenesh (Neurospora) gene prediction program at "Softberry", 
http://www.softberry.con^erry.phtml?topic=gfind); gene orientations are indicated by "+" 
(forward strand) and (reverse strand). 

ORF1 of LOLl is lolE: +strand, join 23457-24195, 24275-24306 (predicted by fgenesh at 
Softberry). 

ORF2 of LOLl is lolT: -strand, join 23003-22916, 22838-22916, 22420-22246, 22170- 
21486 (predicted by fgenesh at Softberry). 
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ORF3 of LOL1 is MP: +strand, join 19245-19554, 19639-20225, 20287-20694, 20818- 
20846, 20919-21045 (predicted by fgenesh at Softberry). 

ORF4 is lolU: -strand, join 17377-17023, 16832-15889 (predicted by fgenesh at 
Softberry). 

ORF5 of LOL1 is lolA: +strand, join 14951-15476, 15545-15648 (determined by 
sequencing of lolA cDNA). 

ORF6 of LOLl is lolO: -strand, join 13961-13770, 13781-13677 (predicted by fgenesh at 
Softberry). 

ORF7 ofLOLl is/o/D: +strand,join 10462-10588, 10945-11115, 11194-11757, 12211- 
12240, 12376-12383 (predicted by fgenesh at Softberry). 

ORF8 of LOLl is lolC: +strand, join 6903-7000, 7063-7114, 7199-7282, 7364-7723, 
7810-8364, 8435-8709 (determined by sequencing of lolC cDNA). 

ORF 9 of LOL1 is lolF: -strand, join 5095-5028, 4960-3509, 3448-3346 (predicted by 
fgenesh at Softberry). 

The LOL2 gene cluster spans about a 16.4 kB region and consists of at least 8 ORFs. It 
appears that LOL2 may include lolf2 and /o/Af (SEQ ID NO: 17) linked to the 5' end of SEQ ID 
NO: 16, in which case the LOL2 gene cluster would span about a 24kB region, consisting of 10 
ORFs (i.e., ORF1' through ORF10*). ORFs of LOL2 are indicated relative to nucleotide numbers 
annotated to SEQ ID NO: 16; mRNA sequences of each gene are given by joined exons 
determined by cDNA sequencing or predicted by the fgenesh (Neurospora) gene prediction 
program at "Softberry", http://www.softberry.com/berry.phtml?topic=gfind); gene orientations 
are indicated by "+" (forward strand) and "-" (reverse strand). 

ORF1' o(LOL2 is lolE: +strand, join 15210-15946, 16026-16057 (predicted by fgenesh 
at Softberry). 

ORF2' of LOL2 is lolT: -strand, join 14753-14666, 14588-13997, 13920-13206 (predicted 
by fgenesh at Softberry). 

ORF3' of LOL2 xslolP: +strand,join 11163-11257, 11551-11762, 11836-11925, 12000- 
12541 (predicted by fgenesh at Softberry). 

ORF4' of LOL2 is lolU: -strand, join 10438-9597, 9531-8916 (predicted by fgenesh at 
Softberry). 
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ORF5' of LOL2 is lolA: +strand, join 8006-8534, 8603-8706 (predicted by sequencing of 
lolA cDNA). 

ORF6' of LOL2 is lolO: -strand, join 7190-6999, 6907-601 1 (predicted by fgenesh at 
Softberry). 

ORF7' of LOL2 is lolD: +strand, join 3867-3993, 4103-4525, 4616-5026, 5118-5143 
(predicted by fgenesh at Softberry). 

QKSV of LOL2 is/o/C: +strand,join 873-970, 1033-1084, 1167-1250, 1334-1693, 1782- 
2335, 2406-2679 (predicted by sequencing of lolC cDNA). 

It also appears that LOL2 may include lolF2, an allele of lolF, and lolM, probably linked 
to the 5' end of LOL2 (SEQ ID NO: 16). The ORFs of lolF2 and lolM are indicated relative to 
nucleotide numbers annotated to sequence of SEQ ID NO: 17; mRNA sequences of each gene 
are given by joined exons predicted by the fgenesh (Neurospora) gene prediction program at 
"Softberry", http://www.softberry.con^erry.phtml?topic=gfind); gene orientations are indicated 
by "+" (forward strand) and (reverse strand). 

ORF9' is lolF2\ -strand, join 5804-4342, 4281-4207, 3905-3821 (predicted by fgenesh at 

Softberry). 

ORF10' is lolM: -strand, join 1689-1525, 1430-1332, 1231-1174, 1085-1021 (predicted 
by fgenesh at Softberry). 

Example 7 

Functional Assignment of the Loline Alkaloid Gene Clusters 

Most of the predicted gene products show highly significant BLAST matches (E le-7) 
with known biosynthetic enzymes and motifs. The closest BLAST matches and/or motifs of the 
ten genes follow in the order that they occur in the clusters: lolE gave a match to epoxidases; lolT 
and lolT2 matched the diagnostic domain of a-type pyridoxal phosphate (PLP)- associated 
enzymes, including class-v aminotransferases; lolP matched cytochromes P450, with closest 
relationship to pisatin demethylase from Nectria haematococca\ lolU gave no significant match 
or diagnostic motif; lolA closely matched the Asp kinase allosteric amino acid binding domain; 
lolO matched nonheme-Fe oxidoreductases, especially isopenicillin N synthase; lolD matched 
ornithine decarboxylase (an a-type PLP enzyme); lolC appeared to be a y-type PLP enzyme; lolF 
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and lolF2 appeared to encode an FAD-containing monooxygenase with closest match to 
cyclohexanone oxidase; lolM had no significant BLAST match or motif. 

Example 8 

Hybridizable Variants 

The nucleic acids of the present invention comprise at least a nucleotide sequence of all 
or part of SEQ ID NO: 15 or SEQ ID NO: 16 or variants thereof It also appears that SEQ ID 
NO: 17 or variants thereof may be part of the LOL2 gene cluster linked to the 5 f end of SEQ ID 
NO: 16, and therefore, nucleic acid sequences that hybridize to all or part of SEQ ID NO: 17 are 
also encompassed by the present invention. Variants of the present invention encode isolated 
nucleic acids that at least hybridize to all or part of SEQ ID NO. 15 or SEQ ID NO. 16 or the 
complements thereof under hybridization conditions of, at, or between, low and high stringency 
conditions, and have insecticidal activity. Low stringency conditions are generally about 3xSCC 
at about 45°C to about 65°C, and high stringency conditions are generally about O.lxSSC, 
0.1%SDS at about 65° to 68° C. Preferably, the hybridization conditions are highly stringent at 
0. lxSSC, 0. 1%SDS at 65°C. Variants are made by methods known to one of ordinary skill in the 
art and as set forth in Maniatis et al. Molecular Cloning: A Laboratory Manual (Current Edition). 
Preferably, the hybridized nucleic acids code for a polypeptide that has one or more or all of the 
physical and/or biological properties of loline alkaloids, such as insecticidal activity and feeding 
deterrent properties. 

Example 9 

Host- Vector System 

Identification and cloning of the loline alkaloid gene clusters is useful for the 
development of host- vector system for the efficient recombination production of both novel and 
known alkaloids. The coding sequences which collectively encode a loline-type alkaloid gene 
cluster, including variants, hybrids, mutants, analogs or derivatives of the loline alkaloid gene 
cluster, can be inserted into one or more expression vectors, using methods known to those of 
skill in the art. The replacement gene cluster need not correspond to the complete native loline 
alkaloid gene cluster, but need only encode a functional gene cluster to catalyze production of an 
alkaloid. 
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The recombinant vector(s) of the present invention includes replacement gene clusters 
derived from a single gene cluster, or may comprise hybrid replacement gene clusters with, e.g., 
a gene of one cluster replaced by the corresponding gene from another gene cluster. For 
example, the oxidoreductase of LOL1 may be replaced with the oxidoreductase of LOL2 without 
an effect on the product structure. Accordingly, these genes may be freely interchangeable in the 
constructs described herein. Thus, the replacement clusters of the present invention can be 
derived from any combination of alkaloid gene sets, which ultimately function to produce an 
identifiable alkaloid. 

Expression vectors also include control sequences operably linked to the desired alkaloid 
coding sequence. Suitable expression systems for use with the present invention include 
systems, which function in eucarybtic and procaryotic host cells. However, procaryotic systems 
are preferred, and in particular, systems compatible with Neotyphodium, Epichloe, Adenocarpus 
and Argyreia mollis species are of particular interest. Control elements for use in such systems 
include promoters, optionally containing operator sequences, and ribosome binding sites. 
Particularly useful promoters include control sequences derived from alkaloid gene clusters. 
However, other bacterial promoters, such as those derived from sugar metabolizing enzymes, 
such as galactose, lactose {lac) and maltose, will also find use in the present constructs. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp), the beta-lactamase {bid) promoter system, bacteriophage lambda PL, and T5. 
In addition, synthetic promoters, such as the tac promoter, which do not occur in nature also 
function in bacterial host cells. 

Other regulatory sequences may also be desirable which allow for regulation of 
expression of the replacement gene cluster relative to the growth of the host cell. Regulatory 
sequences are known to those of skill in the art, and examples include those which cause the 
expression of a gene to be turned on or off in response to a chemical or physical stimulus, 
including the presence of a regulatory compound. Other types of regulatory elements may also 
be present in the vector, for example, enhancer sequences. 

Selectable markers can also be included in the recombinant expression vectors. A variety 
of markers are known which are useful in selecting for transformed cell lines and generally 
comprise a gene whose expression confers a selectable phenotype on transformed cells when the 
cells are grown in an appropriate selective medium. Such markers include, for example, genes 
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which confer antibiotic resistance or sensitivity to the plasmid. 

The various subunits of gene clusters of interest can be cloned into one or more 
recombinant vectors as individual cassettes, with separate control elements, or under the control 
of, e.g., a single promoter. These subunits can include flanking restriction sites to allow for the 
easy deletion and insertion of other subunits so that hybrid gene clusters can be generated. The 
design of such unique restriction sites is known to those of skill in the art and can be 
accomplished using the techniques described above, such as site-directed mutagenesis and PCR. 

Further, the vectors, which collectively encode a replacement gene cluster can be inserted 
in to one or more host cell, using methods known to those of skill in the art. As such, the present 
invention also provides host cells which have their naturally occurring gene substantially deleted, 
transformed with vectors encoding a replacement gene cluster or parts thereof, for the production 
of active alkaloids. The invention provides for the production of significant quantities of product 
at an appropriate stage of the growth cycle. The alkaloids so produced can be used as an 
insecticidal and feeding-deterrent agents to protect plants. The ability to recombinantly produce 
alkaloids also provides a powerful tool for characterizing biosynthetic enzymes and the 
mechanism of their actions. 

More particularly, host cells for the recombinant production of the subject alkaloids can 
be derived from any organism with the capability of harboring a recombinant gene cluster. Thus, 
the genetically engineered host cells of the present invention can be derived from either 
procaryotic or eucaryotic organisms. Preferably, the host may be E. coli. However, more 
preferred host cells are those constructed from the Neotyphodium species, among others, will 
provide convenient host cells for the subject invention. 

The above-described host cells are genetically engineered by deleting the naturally 
occurring loline alkaloid genes or genes encoding tailoring enzymes therefrom, using standard 
techniques, such as by homologous or heterologous recombination. One or more recombinant 
vector, collectively encoding a replacement gene cluster of the present invention, is then 
introduced into a host cell. The vector(s) can include native or hybrid combinations of loline 
alkaloid gene cluster subunits, or mutants, analogs, or derivatives thereof. Methods for 
introducing the recombinant vectors of the present invention into suitable host cells are known to 
those of skill in the art and typically include the use of CaCb or other agents, such as divalent 
cations and DMSO. DNA can also be introduced into bacterial cells by electroporation. Once 
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the genes or gene clusters are expressed, the alkaloid producing colonies can be identified and 
isolated using known techniques. The produced alkaloids can then be further characterized, e.g. 
by NMR and mass spectroscopy. 

Although illustrative embodiments of the present invention have been described in detail, 
it is to be understood that the present invention is not limited to those precise embodiments, and 
that various changes and modifications can be effected therein by one skilled in the art without 
departing from the scope and spirit of the invention as defined by the appended claims. 
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